A semi-supervised cluster-and-label approach for utterance classification
نویسندگان
چکیده
In this paper, we propose a semi-supervised cluster-and-label algorithm for utterance classification. The approach assumes that the underlying class distribution is roughly captured through– fully unsupervised–clustering. Then, a minimum number of labeled examples is used to automatically label the extracted clusters so that the initial label set is ”augmented” to the whole clustered data. The optimum cluster labeling is achieved by means of the Hungarian algorithm, traditionally used to solve optimization assignment problems. Finally, the augmented labeled set is applied to train an SVM classifier. We compare this semi-supervised approach to a fully supervised version in which the initial labeled sets are directly used to train the SVMmodel.
منابع مشابه
Semi-Supervised Learning Based Prediction of Musculoskeletal Disorder Risk
This study explores a semi-supervised classification approach using random forest as a base classifier to classify the low-back disorders (LBDs) risk associated with the industrial jobs. Semi-supervised classification approach uses unlabeled data together with the small number of labelled data to create a better classifier. The results obtained by the proposed approach are compared with those o...
متن کاملGraph Based Semi-supervised Learning in Computer Vision
OF THE DISSERTATION Graph Based Semi-Supervised Learning in Computer Vision by Ning Huang Dissertation Director: Joseph Wilder Machine learning from previous examples or knowledge is a key element in many image processing and pattern recognition tasks, e.g. clustering, segmentation, stereo matching, optical flow, tracking and object recognition. Acquiring that knowledge frequently requires huma...
متن کاملLabel Propagation for Semi-Supervised Learning in Self-Organizing Maps
Semi-supervised learning aims at discovering spatial structures in high-dimensional input spaces when insufficient background information about clusters is available. A particulary interesting approach is based on propagation of class labels through proximity graphs. The Emergent Self-Organizing Map (ESOM) itself can be seen as such a proximity graph that is suitable for label propagation. It t...
متن کاملMulti Label Spatial Semi Supervised Classification using Spatial Associative Rule Mining and Evolutionary Algorithms
Multi-label spatial classification based on association rules with multi objective genetic algorithms (MOGA) enriched by semi supervised learning is proposed in this paper. It is to deal with multiple class labels problem. In this paper we adapt problem transformation for the multi label classification. We use hybrid evolutionary algorithm for the optimization in the generation of spatial assoc...
متن کاملSemi-Supervised Classification Based on Classification from Positive and Unlabeled Data
Most of the semi-supervised classification methods developed so far use unlabeled data for regularization purposes under particular distributional assumptions such as the cluster assumption. In contrast, recently developed methods of classification from positive and unlabeled data (PU classification) use unlabeled data for risk evaluation, i.e., label information is directly extracted from unla...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010